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ABSTRACT 

We investigate the properties of the (complex) coefficients obtained in a spherical 
harmonic representation of temperature maps of the cosmic microwave background 
(CMB). We study the effect of the coefficient phase only, as well as the combined 
effects of phase and amplitude. The method used to check for anomalies is to construct 
a "random walk" trajectory in the complex plane where the step length and direction 
are given by the amplitude and phase (respectively) of the harmonic coefficient. If 
the fluctuations comprise a homogeneous and isotropic Gaussian random field on the 
sky, the path so obtained should be a classical "Rayleigh flight" with very well known 
statistical properties. We illustrate the use of this random-walk representation by 
using the net walk length as a test statistic, and apply the method to the coefficients 
obtained from a Wilkinson Microwave Anisotropy Probe (WMAP) preliminary sky 
temperature map. 
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1 INTRODUCTION 

The process of gravitational instability initiated by small 
primordial density perturbations is a vital ingredient of cos- 
mological models that attempt to explain how galaxies and 
large-scale structure formed in the Universe. In the stan- 
dard cosmological models, a period of accelerated expansion 
( "inflation" ) generated density fluctuations with simple sta- 
tistical properties through quantum processes (Starobinsky 
1979, 1980, 1982; Guth 1981; Guth & Pi 1982; Albrecht & 
Steinhardt 1982; Linde 1982). In this scenario the primordial 
density field is assumed to form a statistically homogenous 
and isotropic Gaussian Random Field (GRF). Over years 
of observational scrutiny this paradigm has strengthened its 
hold in the minds of cosmologists and has survived many 
tests, culminating in those furnished by the Wilkinson Mi- 
crowave Anisotropy Probe (WMAP; Bennett et al. 2003; 
Hinshaw et al 2003). 

Since the release of the first (preliminary) WMAP data 
set it has been subjected to a number of detailed inde- 
pendent analyses that have revealed some surprising fea- 
tures. Eriksen et al. (2004) have pointed out the existence 
of a North-South asymmetry suggesting that the cosmic mi- 
crowave background (CMB) revealed by the WMAP data 
is not statistically homogeneous over the celestial sphere. 
This is consistent with the results of Coles et al. (2004) who 
found evidence for phase correlations in the WMAP data; 



see also Hajian & Souradeep (2003) and Hajian, Souradeep 
& Cornish (2004). The low-order multipoles of the CMB also 
display some peculiarities (de Oliveira-Costa et al. 2004a; 
Efstathiou 2004). Vielva et al. (2004) found significant non- 
Gaussian behaviour in a wavelet analysis of the same data, 
as did Chiang et al. (2004), Larson & Wandelt (2004) and 
Park (2004). Other analyses of the statistical properties of 
the WMAP have yielded results consistent with the standard 
form of fluctuation statistics (Komatsu et al. 2003; Colley & 
Gott 2003). These unusual properties may well be generated 
by residual foreground contamination (Banday et al. 2003; 
Naselsky et al. 2003; de Oliveira-Costa et al. 2004; Dineen & 
Coles 2004) or other systematic effects, but may also provide 
the first hints of physics beyond the standard cosmological 
model. 

In order to tap the rich source of information provided 
by future CMB maps it is important to devise as many in- 
dependent statistical methods as possible to detect, isolate 
and diagnose the various possible causes of departures from 
standard statistics. One particularly fruitful approach is to 
look at the behaviour of the complex coefficients that arise 
in a spherical harmonic analysis of CMB maps. Chiang et 
al. (2004), Chiang, Naselsky & Coles (2004), and Coles et 
al. (2004) have focussed on the phases of these coefficients 
on the grounds that a property of a statistically homoge- 
nous and isotropic GRF is that these phases are random. 
Phases can also be use to test for the presence of primor- 
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dial magnetic fields (Chen et al. 2004; Naselsky et al. 2004) 
or evidence of non-trivial topology (Dineen, Rocha & Coles 
2004). In this paper we suggest an extension of this idea 
which involves viewing the entire set of harmonics (both 
amplitude and phase) in the complex plane. 

We describe the basic harmonic description in the next 
Section. In Section 3 we show how to represent the spherical 
harmonics as random-walks in the complex plane and give 
some simple analytic results. We apply the idea to the pre- 
liminary WMAP 1-year data in Section 4 and briefly discuss 
the results in Section 5. 



where Ci is the angular power spectrum, the subject of much 
scrutiny in the context of the cosmic microwave background 
(e.g. Hinshaw et al. 2003), and 5 xx i is the Kronecker delta 
function. Since the phases are random, the stochastic prop- 
erties of a statistically homogeneous and isotropic Gaussian 
random field are fully specified by the C; , which determines 
the variance of the real and imaginary parts of a;, m both of 
which are Gaussian: 

cr 2 (xi <m ) = a 2 (y hm ) = of = ^Ci. (8) 



3 RANDOM WALKS IN HARMONIC SPACE 



2 SPHERICAL HARMONICS AND GAUSSIAN 
FLUCTUATIONS 

We can describe the distribution of fluctuations in the mi- 
crowave background over the celestial sphere using a sum 
over a set of spherical harmonics: 
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Here A(9, <fi) is the departure of the temperature from the 
average at angular position (8, <f>) on the celestial sphere in 
some coordinate system, usually galactic. The Y, m (0, (j)) are 
spherical harmonic functions which we define in terms of the 
Legendre polynomials P !m using 



Y lm (e,4>) = ^r f 2l +^ m ™ y - P lm (cos6)e^, (2) 

i.e. we use the Condon-Shortley phase convention. In Equa- 
tion (1), the a;_ m are complex coefficients which can be writ- 
ten 



di.m = Xl,m + iyi,m = |oj,m| exp[i<^,, 



(3) 



Note that, since A is real, the definitions (2) & (3) requires 
the following relations between the real and imaginary parts 
of the cii. m : if m is odd then 



Xl,m = K(Oi, m ) = -5ft(o ( ,_ m ) = -Xl- m , 

Vl,m = ^(ai,m) = S(aj,_ m ) = Vl-m\ 

while if m is even 

xi, m = Sft(aj, m ) = K(a;,_ m ) = xi, 

yi,m = ^(ai,m) = -^{ai,-m)=yi, 

and if m is zero then 
Q(ai, m ) = yi,o = 0. 
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-mi 



(4) 



(5) 



(6) 



From this it is clear that the m = mode always has zero 
phase, and there are consequently only I independent phase 
angles describing the harmonic modes at a given I. Without 
loss of information we can therefore restrict our analysis to 
m > 0. 

If the primordial density fluctuations form a Gaussian 
random field in space the temperature variations induced 
across the sky form a Gaussian random field over the celes- 
tial sphere. This means that 



{ai,ma*',m,') = Ci8ii'8„ 



(7) 



To begin with, we concentrate on a simple measure based on 
the distribution of total displacements. Consider a particular 
value of I. The set of values {ai jm } can be thought of as steps 
in a random walk in the complex plane, a structure which 
can be easily visualized and which has well-known statistical 
properties. 

The simplest statistic one can think of to describe the 
set {ai jlrl } is the net displacement of a random walk corre- 
sponding to the spherical harmonic mode I, i.e. 



Ri = (Xt,Yi) = ^ ai, m , 



(9) 



where the vector ai, m = (xi tm , yi,m) and the random walk 
has an origin at ai t o (which is always on the x-axis). The 
length of each step a^ m = |ai, m | is the usual spherical har- 
monic coefficient described in the previous section and de- 
fined by equation (1). If the initial fluctuations are Gaussian 
then the two components of each displacement are indepen- 
dently normal with zero mean and the same variance (8). 
Each step then has a Rayleigh distribution so that the prob- 
ability density for a ijm to be in the range (a, a + da) is 



p(a) 



■ exp 



2a? 



(10) 



This is a particularly simple example of a random walk (Mc- 
Crea & Whipple 1940; Chandrasekhar 1943; Hughes 1995). 
Since the displacements in x and y are independently Gaus- 
sian the next displacement after I steps is itself Gaussian 
with variance la?. The total displacement Ri is given by 



R? = (X? + Y?) 



(11) 



The probability density of \Ri \ to be in the range (r, r + dr) 
is then itself a Rayleigh distribution of the form 



r 

la? 



Pi( r ) = — exp 



Ha? 



(12) 



The result (12) only obtains if the steps of the random 
walk are independent and Gaussian. If the distribution of 
the individual steps is non-Gaussian, but the steps are in- 
dependent, then the result (11) will be true for large I by 
virtue of the Central Limit Theorem. Exact results for fi- 
nite I for example non-Gaussian distributions are given by 
Hughes (1995). 

A slightly different approach is to keep each step length 
constant. The simplest way of doing this is to define 



Ri = 



E — 



m>0 



ai,i 



= J2 a ^ 



(13) 



m>0 
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so that each step is of unit length but in a random direction. 
This is precisely the problem posed in a famous letter by 
Pearson (1905) and answered one week later by Rayleigh 
(1905). In the limit of large numbers of steps the result maps 
into the previous result (12) with of = 1 by virtue of the 
Central Limit Theorem. For finite values of I there is also 
an exact result which can be derived in integral form using 
a method based on characteristic functions (Hughes 1995). 
The result is that the probability density for Ri to be in the 
range r,r + dr is 



<li( 



r) — r 

Jo 



uJo(ur)[Jo(u)] du. 



(14) 



The integral is only convergent for I > 2 but for 2 = 1 or 
I — 2 straightforward alternative expressions are available 
(Hughes 1995). 

One can use this distribution to test for randomness 
of the phase angles without regard to the amplitudes. To 
see how this works, consider the following simple model of 
phase correlations. Following Watts, Coles & Melott (2003) , 
Suppose that the phase difference between adjacent modes 
has a preferred angle, as modelled by the relation 



Here 
and ui 
In this model 



(15) 

= | sin#|, (ui, m ) = 
= 0, to keep the step length equal to unity. 



ai, m = ai_ ra+ i costf - ui, ra+ i. 

is a random variable with \ui 
i-ai, 



(ai, m • ai >m+n ) = (ai >m+ i ■ ai, m ) cos6> = cos" 
Hence 

(Rf) = I + 2 ^2 ( a i,m • ai >ra+n ), 



which is 



(Rf) = N + 2 cosU ~ m 9 - 



(16) 



(17) 



(18) 



(Rf) 



Using the properties of a geometric series this can be seen 
to be 

'r^l) = ^ (19) 

1 — cos a J 

compared with the simple (R t ) = I which obtains if the 
phases are random. The distribution of total displacements 
therefore encodes information about phase correlations in 
the form of a "persistence length", £. The distribution of 
Ri therefore encodes information about phase correlations: 
if there is no preferred direction, cos 9 = and the result 
reduces to the case of a purely random walk with £ = 1. Non- 
zero values of cos 6 will alter the mean square displacement 
relative to this case: it can be either larger if cos 6 is positive 
and steps tend to occur in the same direction in the complex 
plane (£ > 1), or smaller if £ < 1. 

So far we have concentrated on fixed I with a random 
walk as a function of m. We could instead have fixed m and 
considered a random walk as a function of I. Or indeed ran- 
domly selected N values of I and m. In either case the results 
above still stand except with of replaced by an average over 
all the modes considered: 



1 \ ~> 2 




(20) 
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Figure 1. The random walk performed by the spherical harmonic 
coefficients for I = 532, statistically the mode that displays the 
greatest departure from that expected under the null hypothesis. 
The outer circles correspond to 99.9, 99 and 95 per cent upper 
confidence limits s (from outer to inner); the inner circles are the 
corresponding lower limits, though the 99.9 per cent lower limit 
is too small to see. 



We do not consider this case any further in this paper. 



4 APPLICATION TO THE WMAP 1 YR DATA 

There are many possible ways of using the properties of a 
random walk to test the hypothesis that the CMB tempera- 
ture fluctuations are drawn from a statistically homogeneous 
and isotropic Gaussian random field on the sky could be fur- 
nished by comparing the empirical distribution of harmonic 
random flights with the form (12). This requires an estimate 
of of. This can either be made using the same data or by 
assuming a given form for Ci, in which case the resulting 
test would be of a composite hypothesis that the fluctua- 
tions constitute a Gaussian random field with a particular 
spectrum. For large I this is can be done straightforwardly, 
but for smaller values the sampling distribution of Ri will 
differ significantly from (12) because of the uncertainty in 
population variance from a small sample of ai m . This is the 
so-called "cosmic variance" problem. 

In practice the most convenient way to assess the signif- 
icance of departures from the relevant distribution would be 
to perform Monte Carlo experiments of the null hypothesis. 
For statistical measures more complicated than the net dis- 
placement, the best way to set up a statistical test is to use 
Monte-Carlo re-orderings of the individual steps to estab- 
lish the confidence level of any departure from Gaussianity. 
This also enables one to incorporate such complications as 
galactic cuts. 

The WMAP team released an Internal Linear Combina- 
tion (ILC) map that combined five original frequency band 
maps in such a way to maintain unit response to the CMB 
whilst minimising foreground contamination. The construc- 
tion of this map is described in detail in Bennett et al. 
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confidence level 


95 


99 


99.9 


unit length 


33 


6 


2 


variable length 


20 


4 


1 


null 


30 


6 


0.6 



Table 1. Results for random- walks with variable step- length and 
unit step-length for the WMAP ILC data. The columns show the 
number of /-modes for which the test statistic lies outside the a 
percent confidence limit for the Monte Carlo skies. The final row 
shows the number of modes that would be expected, on average, 
to lie outside this range if the null hypothesis were true. 



(2003). The weighted map is produced by minimizing the 
variance of the temperature scale such that the weights add 
to one. To further improve the result, the inner Galactic 
plane is divided into 11 separate regions and weights deter- 
mined separately. This takes account of the spatial varia- 
tions in the foreground properties. Thus, the final combined 
map does not rely on models of foreground emission and 
therefore any systematic or calibration errors of other ex- 
periments do not enter the problem. The final map covers 
the full-sky and the idea is that it should represent only the 
CMB signal. Following the release of the WMAP 1 yr data 
Tcgmark, Oliveira-Costa & Hamilton (2003; TOH) produced 
a cleaned CMB map. They argued that their version con- 
tained less contamination outside the Galactic plane com- 
pared with the ILC map produced by the WMAP team. 

The ILC map is not intended for statistical analysis but 
in any case represents a useful "straw man" for testing sta- 
tistical techniques for robustness. To this end, we analyzed 
the behaviour of the random-walks representing spherical 
harmonic from I = 1 to I = 600 in the WMAP ILC. Sim- 
ilar results are obtained for the TOH map so we do not 
discuss the TOH map here. For both variable-length (9) 
and unit-length (13) versions of the random-walk we gen- 
erated 100000 Monte Carlo skies assuming Gaussian statis- 
tics. These were used to form a distribution of |Ri| (or |Ri|) 
over the ensemble of randomly-generated skies. A rejection 
of the null hypothesis (of stationary Gaussianity) at the a 
per cent level occurs when the measured value of the test 
statistic lies outstide the range occupied by a per cent of 
the random skies. The results we obtained are summarized 
in Table 1. 

The results show that this simple test does not strongly 
falsify the null hypothesis, which is not surprising given 
the simplicity of the measure we have used. The number 
of modes outside the accepted range is close to that which 
would be expected if the null hypothesis were true. Notice 
that slightly more modes show up in the unit length case 
than in the other, perhaps indicating that the phase corre- 
lations that are known to exist in this data (Chiang et al. 
2004) are masked if amplitude information is also included. 
The most discrepant mode turns out to be I — 532 in both 
cases. For interest a plot of the random walk for this case is 
included as Figure 1. 

We can turn these results into limits on the persistence 
length: if £. If £ =fc there would be either more observed 
walks longer than the limits (£ > 1) corresponding to the 
null hypothesis or more shorter ones (£ < 1). Assuming the 



form (12), which should hold for large values of I, the current 
data place rough limits of 0.9 < £ < 1.1. 



5 DISCUSSION AND CONCLUSIONS 

We have presented a novel representation of the behaviour 
of spherical harmonic coefficients obtained in the represen- 
tation of of CMB temperature maps. Indeed, the main result 
of this paper is the construction of two-dimensional walks 
as discussed in Section 3. In order to illustrate the method 
in the simplest way we performed a simple statistical test 
on the net displacement of the random walks for different I 
modes. The results of this analysis do not provide conclusive 
evidence for departures from a homogeneous and isotropic 
GRF. For example, two modes are discrepant at 99.9 per 
cent confidence, compared with the 0.6 expected by chance. 
In a series of independent random skies with an expecta- 
tion of 0.6 modes, Poisson statistics give a probability of 88 
per cent that less than two modes would be discrepant. The 
detection of two modes is therefore not an indication of a 
strong departure from chance. 

It is worth noting that the method does not pick out 
any significant departures at low I, which is where the non- 
parametric tests of Coles et al. (2004) were applied with 
positive results. This test works better with large numbers 
of independent m modes. In the simple form presented here 
it seems less powerful than the phase-mapping technique of 
Chiang et al. (2003) and Chiang et al. (2004). Whatever 
forms of non-Gaussianity and/or non-stationarity there are 
in these data, this very simple test is not sensitive to them. 

As we explained above, however, the net displacement 
of the random walk is a simple but rather crude indica- 
tion of the properties of the {ai >m }, as it does not take into 
account the ordering of the individual steps. The possible 
non-Gaussian behaviour of the set {ai jm } is encoded not 
so much in the net displacement but in the shape of the 
random walk. To put this another way, there are many pos- 
sible paths with the same net displacement, and these will 
have different shapes depending on the correlations between 
step size and direction. Long runs of directed steps or reg- 
ular features in the observed structure could be manifesta- 
tions of phase correlation (Coles et al. 2004). For example, 
the presence of a primordial magnetic field contribution to 
the CMB fluctuations would result in a correlation between 
ai-i, m and ai + i m at fixed m (Chen et al. 2004). Examples 
of how similar effects might be produced by foregrounds, 
galactic cuts and so on are discussed by Naselsky et al. 
(2005). These phase correlations would manifest themselves 
in higher-order statistics of the random walk as a function 
of I. It will also be useful, say, to combine random walks as 
functions of m for different I: correlations between the net 
displacements of such walks would indicate the presence of 
phase correlations too; see Coles et al. (2004). The graphi- 
cal representation of the set {ai jm } in the form illustrated 
by Figure 1 provides an elegant way of visualizing the be- 
haviour of the harmonic modes and identifying any oddities. 
These could be quantified using a variety of statistical shape 
measures: moment of inertia (Rudnick, Beldjenna & Gas- 
pari 1987), fractal dimension, first-passage statistics, shape 
statistics (e.g. Kuhn & Uson 1982), or any of the methods 
use to quantify the shape of minimal spanning trees (Bar- 
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row, Bhavsar & Sonoda 1985). It would be interesting to try 
these more sophisticated measures on subsequent releases of 
the WMAP data. 
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